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The complete nucleotide sequence of the ns2 gene of 
human coronavirus OC43 (HCV-OC43) was deter- 
mined. Sequence analysis revealed an open reading 
frame that could encode a protein of 278 amino acids, 
with an estimated molecular mass of 32:2 kDa. Six 
potential phosphorylation sites are present but no sites 
of N-glycosylation were found. The amino acid sequence 
of the HCV-OC43 ns2 protein shows 92 % identity with 


that of the Mebus strain of bovine coronavirus (BCV). 
However, a stretch of nine consecutive amino acids near 
the C terminus is completely different, causing it to be 
very hydrophilic, which contrasts with the hydrophobic 
nature of this region in BCV. As shown by immuno- 
fluorescence with a monospecific antiserum, the ns2 
protein was expressed in the cytoplasm of HCV-0C43- 
infected HRT-18 cells. 


Coronaviruses are enveloped viruses that contain a 
single-stranded positive sense RNA genome of 27 to 
31 kb (Boursnell et al., 1987; Lee et al., 1991). The 
genomic RNA encodes six to eight capped and poly- 
adenylated subgenomic mRNAs that are arranged in a 
3’-coterminal nested set structure. Each mRNA possesses 
acommon 5’ end leader sequence derived from the 5’ end 
of the genomic RNA. It has been suggested that the 
interaction of the 3’ end of the leader sequence with the 
full-length minus strand genomic RNA, at the consensus 
intergenic sequences, initiates the synthesis of sub- 
genomic mRNAs of diverse lengths (Lai, 1990). 
Human coronaviruses (HCV) have so far been repre- 
sented by two prototype strains, OC43 and 229E, which 
belong to two distinct antigenic groups. HCV-OC43 
shares antigenicity with bovine coronavirus (BCV), 
haemagglutinating encephalomyelitis virus of swine 
(HEV) and murine hepatitis virus (MHV). HCV-229E is 
antigenically related to porcine transmissible gastroen- 
teritis virus (TGEV) and canine coronavirus (CCV). 
Human coronaviruses are responsible for 15 to 35% of 
common colds (McIntosh, 1974; Myint, 1994; Wege et 
al., 1982) and have been associated with severe diarrhoea 
in the newborn (Resta et al., 1985). Their involvement in 
neurologic diseases such as multiple sclerosis has also 
been suggested (Murray et al., 1992; Stewart et al., 1992). 
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Human coronavirus OC43 comprises four or five 
major structural proteins: a peplomer (S) glycoprotein, 
an haemagglutinin-esterase (HE) glycoprotein, a nucleo- 
capsid (N) phosphoprotein and a membrane (M) 
glycoprotein (Mounir & Talbot, 1992, 1993a; Zhang et 
al., 1992; Kamahora et al., 1989), as well as a predicted 
small membrane protein (sM) (Mounir & Talbot, 19935), 
apparently similar to the one previously identified in 
virions of infectious bronchitis virus (IBV) and TGEV 
(Liu & Inglis, 1991, Godet ef al., 1992). In addition to 
these structural proteins, HCV-OC43 possesses several 
open reading frames (ORF) that could encode putative 
nonstructural (ns) proteins (Mounir & Talbot, 19935). 

The OC43 strain of the human respiratory coronavirus 
and the bovine enteric coronavirus are antigenically very 
similar since no polyclonal serum can distinguish 
between them (Hogue et al., 1984). Indeed, the predicted 
amino acid sequences of all described structural and 
nonstructural proteins of these two viruses show over 
91% identity (Mounir et al., 1994). The major genomic 
difference observed so far between HCV-OC43 and BCV 
is the absence on HCV-OC43 of two ORFs that could 
encode putative nonstructural proteins of 49 and 
4:8 kDa in BCV (Mounir & Talbot, 19935; Abraham et 
al., 1990). Further sequence analysis of the BCV genome 
has revealed an additional ORF located upstream of 
mRNA 2 (Cox et al., 1989). This gene was demonstrated 
to be expressed in BCV-infected cells and to encode a 
nonstructural phosphoprotein (ns2) of 32 kDa (Cox ef 
al., 1991). A similar protein has been found in MHV- 
infected cells (Luytjes et al., 1988; Bredenbeek ef al., 
1990). In the present study, we have cloned, sequenced 
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TT GTGAGCGAAGTTGCGTGCGTTGCATCCCGCTTCACTGATCTCTTGTTAGATCTTTTTGTAATCTAAACTTTAAAAATG GCT GTC 86 


GCT TAT GCA GAC AAG CCT AAT CAT TTT ATC AAT TTT 
A 2 A D K P N H F T: N F 


TGT TTA GAT GTG GTA CGT AAA GAG CTA GAA GAA GGC 


AAG CTA TCG TAC CAT TAT GAA AAC AAT TCT TGG TTC 


K L Ss Y H XY E N N s Ww F 


CGT AAG GCT TGT CAA AAT TTA GAT TGT AAT TGT TTG 
N L D Cc N c L 


R K A c Q 


M A Vv 3 


CCA CTT ACC CAT TTT CAG GGT TTT GTG TTA AAT 155 


L T H F Q G F Vv L N 26 


GAT TGT AAA ATA CAA ACA GCG CCA CAC ATT 224 


AGT GTT GAT GTC GCT ATT CAA GAA GTT ATT 293 
Ss Vv DB Vv A I Q E Vv I 72 


P 

GTG 

Vv D c K I Q T A P H I 49 
AAA 

K 


TTT GAG AAT CCT CAC ATC CTA GGA AGA TGC ATA 362 


H bE L G R Cc I 95 


TAC ATT CGT GAT AAA GGT TGT 431 
Y I R D K G Cc 118 


CTC ACG GAT GCA GCA CTG TCC 500 


I A L T D A A L s 141 
e 

AAT 

N Y I T I N Pp Ss Ss 164 

AAA GAT TTT TAT GAA ACC ATA GTT AGT 638 

K D F Y E p T: Vv Ss 187 


GCT ATG ATT ATG GGT TAT TGT 707 


P 
AAT 
N 
CAA 
Q 
TAT AAA ATC ACC ATC AAC CCC TCA TCA 569 
K 
GGT 
G 
CTA 
L A M I M G Y Cc 210 


GAT CTT CCC GAG AAT GAT GAT GAT GCT TGG TTT 776 


L P E N D D D A Ww F 233 


TTC CGA CAT GTC TAC AGG AAA AGT TTT CAT TTC 845 


R H Vv Y R K s F H F 256 


GGG TTT TAT GAA TCT CCA GTT GAA GAA GAC TAA 914 


F Y E s P Vv E E D ‘ 278 


Fig. 1. Complete nucleotide sequence of the ns2 protein gene of HCV-OC43 and its deduced amino acid sequence. The leader sequence 
is underlined. The intergenic consensus sequence is doubly underlined. Potential phosphorylation sites (@) are indicated. An asterisk 


marks the termination codon. 


and expressed in Escherichia coli the corresponding 
region of HCV-OC43 and identified a gene that could 
encode a protein of 32:-2kDa. With monospecific 
antisera, we have also detected the expression of this 
protein in infected cells. 

The origin and cultivation of HRT-18 cells and the 
OC43 strain of HCV has been described previously 
(Mounir & Talbot, 1992). The nucleotide sequence 
corresponding to the HCV-OC43 ns2 gene was obtained 
by RT-PCR on viral RNAs. The antisense primer 5’ 
TTAGTCTTCTTCAACTGGAG 3’, complementary to 
nucleotides 895 to 914 (numbering is from Fig. 1) of the 
BCV ns2 gene (Cox et al., 1989), was used for cDNA 
synthesis. The sense primer 5’ TTGTGAGCGAAGTT- 
GCGT 3’, homologous to a portion of the HCV-OC43 
leader sequence (Kamahora et al., 1989; Fig. 1, nucleo- 
tides 1 to 18} and the antisense primer were used for 
cDNA amplification. PCR products were cloned into the 
pCRII TA cloning vector (Invitrogen). Nucleotide 
sequencing was performed on both strands of at least 
two clones of two PCR products by the dideoxy- 
nucleotide chain termination method (Sanger et al., 
1977) using T7 DNA polymerase (Pharmacia) and [a- 
35S]dATP (Amersham). Two internal primers were used 


to complete the nucleotide sequence: 5’ GCTATTCA- 
AGAAGTT 3’, homologous to nucleotides 276 to 290, 
and 5’ CGCCTTCTTCTAGCTCT 3’, complementary 
to nucleotides 728 to 744 (numbering is from Fig. 1). 
Sequence analyses, including hydropathy plots (Kyte & 
Doolittle, 1982), were performed with the MacVector 3.5 
(International Biotechnologies, Intersciences Inc.) and 
GeneWorks 2.0 (IntelliGenetics Inc.) sequence analysis 
programs. 

As shown in Fig. 1, the region upstream of the HE 
gene of HCV-OC43 contains an ORF of 278 amino acids 
that could encode a protein with an estimated molecular 
mass of 32:2 kDa. The consensus intergenic sequence 
UCUAAAC observed upstream of many other corona- 
virus genes (Cox et al., 1989; Shieh et al., 1987) was 
found 15 nucleotides upstream of the initiation codon. 
Six potential phosphorylation sites (Fig. 1!) but no 
potential N-glycosylation sites were found in this 
predicted protein. Previous studies on the BCV ns2 
protein revealed the presence of the phosphorylated 
amino acid residues serine and threonine (Cox ef al., 
1991). Interestingly, four of the six potential phos- 
phorylation sites are shared by BCV and HCV-0C43, 
which is consistent with the possibility that the HCV- 
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HCV-0C43 MA~-VAYADKPNHF INF PLTHFOGFVLNYKGLOFOILDEGVDCKIOTAPHISLTMLDIQPEDYKSVDVAI 68 
BCV pea ee ee Oe ney, Oo Re ans Mw es ig aiew dice eee tea ee Bee 2.9 4-4 Ky Ry eens 68 
MHV-JHM is BRM Fcc a aaa wed 3 AQ.S..MGK.LK..S.LVEM.L...L.KV..V.ITL...KADQ..Q.EF.. 70 
MHV-A59 mae NERS cep alee eee ¥ AQ.S..MGK.LK..S.LVEM.L...L.K...V.ITL...KADQ..Q.EF.. 66 
HCV-0C43 QEVIDDMHWGDGFQIKFENPHILGRCIVLDVKGVEELHDDLVNY IRDKGCVADQSRKWIGHCTIAQLTDA 138 
BOY 9° 0 eu hwade aces Bios 520% FN yas bee ie PR I SENS ee ie ae a ae a ate aS ae a Gide Mares 138 
MHV-JHM ..1 LAAYE.-D.V.D...M....L....-. Pewee Bs DELL RR. Te ae oe Q..P...V..FPDEE 139 
MHV-A59 ..I...LAAYE.-D.V.D...M....L....R.F....E.1.EIL.RR..T..... H..P...V..PDEG 135 
HCV-0C43 ALSIKENVDFINSMQFNYKITINPSSPARLEIVKLGAEKKDGFYETIVSHWMGIRFEYTSPTDKLAMIMG 208 
BCVA Gh A YEA SS BEERS Soe Caw ee NS EO RR oe ee BR Rw Re OLS See NP Laos eek ens 208 
MHV-JHM K-E...-MQ.YFKLP.-.LKHN.LLTD....L..1.SS.V....CSEL.I.C.E.LC.KP..P.FSD.F. 206 
MHV-A59 K-RN.R-NE.YHKEP.-.LKHN.LLTD.G..L..I.SS.I....CSEL.V.C.E.LC.KP..P.FSD.F. 202 
HCV-0C43 YCCLDVVRKELEEGDLPENDDDAWFKLSYHYENNSWFFRHVYRKSFHFRKACQNLDCNCLGFYESPVEED* 278 
BCV WS oe eee SP Seite one, Guba eo aleve, wid Sud ie rwllgre rae pegire telser ar cars VOL SVSEV = 53% one eee OS * 277 
MHV-JHM .. IDKI.GD..1I....PD.EE..AE..... ORGY ceca HDN.IV..TV.RMKG.M.* 265 
MHV-A59 ...IDKI.GD..1I...-.QD.EE..AE..... OR PY exe HDN.IV. .TV.RMKG.M.* 261 
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Fig. 2. Amino acid comparison between the ns2 proteins of human, bovine (Cox et al., 1989) and murine (Luytjes ef al., 1988; 
Bredenbeek ef al., 1990) coronaviruses by alignment for maximum identity. Dots indicate residues identical to those of HCV-OC43. 
Hyphens represent gaps introduced into the sequence to maximize alignment. 


OC43 ns2 protein is phosphorylated like its BCV 
counterpart. 

As shown in Fig. 2, the putative HCV-OC43 ns2 
protein shares 92% amino acid sequence identity with its 
BCV counterpart. Interestingly, a stretch of nine con- 
secutive amino acids near the C terminus of the predicted 
protein is completely different between the two viruses. 
The presence of one additional nucleotide on HCV- 
OC43 (at position 834, Fig. 1) is responsible for this 
modification. Twenty-four residues downstream (at 
position 858, Fig. 1), the original reading frame of the 
ns2 gene is restored by insertion of two nucleotides. 
Hydropathy plots for these two ns2 proteins revealed 
that this stretch of nine amino acids is very hydrophilic 
in HCV-OC43 but highly hydrophobic in BCV (data not 
shown). Thus, such a drastic change in this region of the 
protein could affect the secondary and tertiary structure 
of the ns2 protein and consequently modify its biological 
function. The identity levels between the ns2 proteins of 
HCV-OC43 versus those of MHV-A59 and MHV-JHM 
are 49% and 51%, respectively (Fig. 2). 

The ns2 proteins of MHV-A59, MHV-JHM and BCV 
possess three nucleotide binding domains (Cox et al., 
1991; Luytjes et al., 1988). Even though the HCV-OC43 
ns2 protein possesses a similar nucleotide binding 
domain, it is unlikely that this protein could interact with 
RNA since its isoelectric point is 4-8. Indeed, the negative 
charge of the ns2 protein and RNA would result in 
mutual repulsion of these molecules at physiological pH. 

To analyse the expression of the HCV-OC43 ns2 
protein in infected cells, immunofiuorescence assays were 
performed on HRT-18 cells infected with HCV-OC43 at 
an m.o.i. of 0-05. At 66h post infection, the cells were 
fixed with acetone and incubated first with a mouse 


antiserum specific for a recombinant form of the BCV 
ns2 protein (a kind gift of Dr Pascal Boireau, Centre 
national d’études vétérinaires et alimentaires, Maisons- 
Alfort, France), then with fluorescein-conjugated F(ab’), 
fragments of goat anti-mouse antibody (Cappel Research 
Products, Organon Technika Inc.) and observed under 
UV light. As shown in Fig. 3, specific fluorescence was 
distributed uniformly around the nucleus, which is 
consistent with the expression of the ns2 protein in the 
cytoplasm (Fig. 3). As expected, uninfected cells showed 
no fluorescence (Fig. 3a). Negative and positive controls 
were performed on infected cells with a preimmune 
serum (Fig. 3c) and with a murine monoclonal antibody 
directed against the HEV nucleocapsid protein (a kind 
gift of Dr Serge Dea, Institut Armand-Frappier, Laval, 
Québec, Canada) (Fig. 3d). Similar results were obtained 
with a monospecific rabbit antiserum produced against 
the HCV-OC43 ns2 protein. To produce this antiserum, 
the HCV-OC43 ns2 gene was reamplified by PCR with 
sense and antisense primers to which BamHI sites were 
added. The PCR product was purified, digested with 
BamHI and ligated into the BamHI-digested pMAL-c2 
vector (New England Biolabs). The resulting plasmid 
was introduced into E. coli. Clones containing the insert 
were sequenced to verify the integrity of the ns2 gene. 
The production of the maltose-binding protein (MBP) 
fused to the N terminus of the ns2 protein was induced 
in selected clones with 0-3 mm-isopropyl f-thiogalactos- 
ide. The MBP-ns?2 fusion protein was purified by affinity 
chromatography on an amylose resin (New England 
Biolabs) for production of a monospecific rabbit anti- 
serum. Approximately 300 pg of the purified fusion 
protein was injected subcutaneously into a 3 kg New 
Zealand White female rabbit. The antigen was first 
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Fig. 3. Indirect immunofluorescence staining of HCV-OC43-infected cells. At 66 h after infection. cells were fixed and stained with 
primary mouse antibodies, followed by FITC-conjugated goat anti-mouse secondary antibody. (a) Uninfected cells stained with an ns2- 
specific mouse antiserum; (4) infected cells stained with an ns2-specific mouse antiserum; (c) infected cells stained with a preimmune 
mouse antiserum; (d) infected cells stained with an N-specific murine monoclonal antibody. 


administered with complete Freund’s adjuvant and 
subsequent injections (every 2 weeks) used incomplete 
Freund’s adjuvant. The antiserum was collected when 
optimal levels of specific antibodies to the MBP-ns2 
fusion protein were reached; this required seven in- 
jections, an indication of the low immunogenicity of this 
protein, at least in rabbits. 

In conclusion, we have shown that HCV-OC43 does 
express a protein of 32 kDa encoded by mRNA 2 (ns2), 
as was previously observed with BCV (Cox et al., 1991) 
and MHV (Bredenbeek ef a/., 1990; Zoltick et al., 1990). 
It is noteworthy that the ns2 gene has so far only been 
observed in coronaviruses of the same antigenic group, 
in which an HE gene is also present, except for MHV- 
AS59, which does nevertheless contain a pseudogene 
(Luytjes et al., 1988; Shieh et al., 1989). Therefore, 
possible physical and/or functional interactions between 
the ns2 and HE proteins or genes need to be investigated. 

This completes the characterization of the region 
downstream of the polymerase gene of HCV-OC43. The 
polymerase gene has been shown in other coronaviruses 
to encode several putative nonstructural polypeptides 
that have yet to be fully characterized (Boursnell et al., 
1987; Denison et al., 1991; Herold et al., 1993). The 
genomic organization of this human coronavirus is thus 


as follows: 5’ pol/ns2/HE/S/ns4/sM/ns5-1/M/N 3’. 
The high amino acid identities (91 to 97%) observed 
between HCV-OC43 and BCV structural and non- 
structural proteins suggests that the two viruses have 
diverged only recently. However, they have developed 
different pathogenic properties, including an apparent 
lack of replication of HCV-OC43 in a bovine host (P. J. 
Talbot & L. A. Babiuk, unpublished observations). The 
previously reported genomic deletion in HCV-OC43 
compared to BCV (Mounir & Talbot, 19935; Abraham 
et al., 1990), remains the only major structural difference 
between the two viruses, although its relevance to the 
biology of these structurally related viruses, one a 
recognized respiratory pathogen (HCV-OC43) and the 
other a recognized enteric pathogen (BCV), remains to 
be established. Genetic engineering of the HCV-OC43 or 
BCV genome will most likely be necessary for a definitive 
characterization of the molecular basis of virus tropism. 
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